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ABSTRACT 

We describe a model of dark matter halo abundances and clustering which com- 
bines the two most widely used approaches to this problem: that based on peaks 
and the other based on excursion sets. Our approach can be thought of as ad- 
dressing the cloud- in-cloud problem for peaks and/or modifying the excursion 
set approach so that it averages over a special subset, rather than all possible 
walks. In this respect, it seeks to account for correlations between steps in the 
walk as well as correlations between walks. We first show how the excursion 
set and peaks models can be written in the same formalism, and then use this 
correspondence to write our combined excursion set peaks model. We then give 
simple expressions for the mass function and bias, showing that even the linear 
halo bias factor is predicted to be fc-dependent as a consequence of the nonlo- 
cality associated with the peak constraint. At large masses, our model has little 
or no need to rescale the variable S c from the value associated with spherical 
collapse, and suggests a simple explanation for why the linear halo bias factor 
appears to lie above that based on the peak-background split at high masses 
when such a rescaling is assumed. Although we have concentrated on peaks, our 
analysis is more generally applicable to other traditionally single-scale analyses 
of large-scale structure. 

Key words: large-scale structure of Universe 



1 INTRODUCTION 

IPress fc Schechterl l| 19741 ) argued that the abundance of 



nonlinear virialized objects at late times (such as the 
present) should be sensitive to the statistics of the initial 
fluctuation field, and to the subsequent expansion history 
of the universe. This is the basis for studies which seek 
to use the abundance and clustering of galaxy clusters to 
constrain cosmological parameters. 

Their work has motivated the study of analytical 
models for the formation, and hence the abundance and 
spatial distribution, of halos, which can be used to pro- 
vide fitting formulae w hen interpreting data. Following 
ISheth fc Tormenl(|l999l ). the most widely used fitting for- 
mulae are self-similar, in the sense that the predicted halo 
abundances can be scaled to a universal form which is 
independent of cosmology, redshift and power spectrum. 
This vastly simplifies cosmological analyses. (This uni- 
versality is only expected to hold approximately, and the 
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next generation of datasets may have sufficiently many 
clusters that departures from universality must be ac- 
counted for. We will have more to say about this later.) 

The self-similar functional form can be de- 
rived from a physically motiv ated model of col- 
lapse (|Sheth. Mo fc Tormenl . l200ll '). The number density 
An/ Am of halos in the mass range (m, m + dm) is written 
as 



m An(m) 
p Am 



Am = f{y) Av, 



(1) 



where p is the background density and v = 8 c /a, with 
5 C the rescaled time variable and a the rescaled mass 
variable (<r 2 (m) = (<5 2 (m)) is the variance of the mat- 
ter density field smoothed on a Lagrangian length scale 
corresponding to mass m and linearly extrapolated to 
present day). Universality is manifest in the statement 
that / depends only on v, but, unfortunately, the most 
naive use of this form predicts too few massive clusters. 
This has motivated the following ad-hoc approach: one 
actually fits f(y/qv) to the data, and determines q from 
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the fit. This semi-empirical approach has worked rather 
well, in the sense that q appears to be approximately 
independent of cosmology, redshift and power spectrum, 
although recent simulatio ns are beginning t o show depar- 
tures from universality (|Bagla et all I2009T ). 

Since observations will soon deliver large cluster cat- 
alogs over a range of redshifts, it is clearly desirable to 
have a more fundamental understanding of why q 7^ 1, 
particularly because, on an object by object basis, the 
physical model of collapse almost never has S < 5 C . I.e., 
q < 1 appears to arise in the step which converts from 
the physics of halo formation to a stat istical description 
of halo abundances ( Sheth et al.l . l200lT l. One of the main 
goals of this paper is to provide some insight into the 
origin of this factor. 

To do so, we revisit the two most common models 
for identifying h alos from the in i tial fl uctuation field: the 
peaks theory of iBardeen et all ( 19861. hereaft er BBKS), 
and the excursion set approach of lBond et al.l l|199ll ) . Al- 
though both make predictions which can be phrased in 
terms of the self-similar variable u, and both treat v as the 
ratio of 5 c /cr, the former treats the numerator of this ra- 
tio as the stochastic quantity, whereas for the latter, it is 
the denominator which can vary. They also differ funda- 
mentally in their approach to the problem. Peaks theory 
seeks to describe the point process which describes the 
special positions in the initial conditions around which 
halos collapse. The excursion set approach aims only at 
a statistical description of the mass fraction in bound ob- 
jects, and assumes that this can be done by consideration 
of all points in space - not just the special ones around 
which halos form. Our analysis below shows how to merge 
the two descriptions. 

Section [2] shows that the excursion set and peaks 
models can be written in the same formalism, and then 
describes our excursion set model for peaks, arguing that 
the result goes a substantial way towards explaining the 
origin of the factor of a. Section[3]extends this to describe 
the conditional function of excursion set peaks in con- 
strained larger-scale environments, and from there builds 
a model for t he large scale bias factors. This us es the re- 
cent work of l|Musso. Paranjape fc Shethll20l3 . hereafter 
MPS) to show that halo bias in our approach is generi- 
cally expected to be k dependent. It also shows that at 
high masses, our new expression for halo bias is qualita- 
tively similar to that seen in simulations, again suggest- 
ing that our excursion set peaks model of the origin of 
a 7^ 1 is reasonable. A final section summarizes our re- 
sults, discusses them in the context of previous work on 
the relationship between excursion sets and peaks, and 
suggests ways in which our approach could be improved 
further. 



2 THE UNCONDITIONAL MASS 
FUNCTION 

This section develops what we call the excursion set 
model for peaks: it is both a peaks model which deals with 



different smoothing scales, and an excursion set model 
which deals with the statistics of special rather than ran- 
dom positions. Th is is particularly interesting because 
ISheth et al.l (|200lh have argued that the latter is a nec- 
essa ry change to the standar d excursion set approach, 
and lLudlow fc PorcianJ l|201lF ) have argued that the cor- 
respondence between peaks in the initial conditions and 
halos at late times in their simulations is quite good. 

2.1 Notation 

Let s(R) denote the variance of the (linearly extrap- 
olated) density contrast 5 smoothed on a Lagrangian 
length scale R. Then s = 0% where 



2 

a j = 



A A k 
W) 



3 P{k) k 2j W 2 (kR) 



(2) 



and our notation has dropped the explicit dependence of 
<Tj on R, where we think no confusion will arise. Here 
P(k) is the power spectrum of the field, and W(kR) is 
the Fourier transform of the smoothing filter. For what 
follows, it is also convenient to define 



R* = v3<ti/ct2 



and 



7 = ai/cro<J2 . 



(3) 



Unless stated otherwise, we will always consider Gaussian 
smoothing filters, for which W(kR) = exp(— k 2 R 2 /2), 
and, for the P(k) of current interest in cosmology, all 
the integrals above converge. 

We return to the issue of smoothing filter in the Dis- 
cussion section, because the analysis which follows ex- 
ploits the following property which is special to Gaussian 
smoothing. Namely, the Laplacian of the field on a given 
smoothing scale Vx<5(-R,x), which is a quantity of fun- 
damental importance in peaks theory, is the same as the 
derivative of the field with respect to smoothing scale, a 
fundamental quantity in excursion set theory. 

On dimensional grounds, the volume associated with 
a smoothing filter is V oc R 3 ; for a Gaussian filter 
V = (2nR 2 ) 3 ' 2 . It is natural to associate a mass with 
the smoothing scale R: m = pV, where p is the comov- 
ing background density. We will only consider hierarchi- 
cal models in which the fluctuations in the initial field 
were small. The former means that oo is a monotoni- 
cally decreasing function of R, and the latter that R, 
m and 00 are equivalent variables. In what follows we 
illustrate our results using P(k) for a flat ACDM cos- 
mological model with parameters (fi m , Qa, h, as, n s ) = 
(0.25,0.75,0.7,0.8,0.95). 

2.2 The excursion set approach 

The excursion set approach assumes that the mass frac- 
tion associated with bound halos of mass m at any given 
time t equals the volume fraction of positions in the 
initial fluctuation field which, when smoothed on scale 
R oc m 1//3 , have overdensity S(R) = S c (t) and, for all 
R' > R, have S(R') < 5 c (t). This latter constraint is dif- 
ficult to handle because it implies an infinite number of 
constraints (one for each smoothing scale). 
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iMusso fc Shethl (|2012l . hereafter MS) showed that 
the much simpler requirements that S(R) > 8 C and 
8(R + Ai?) < 5 C for Ai? < 1 (i.e., just one additional 
constraint) permit a simple analytic estimate of this frac- 
tion which is remarkably accurate. Although this fraction 
depends on the power spectrum of the underlying fluc- 
tuation field, much of this dependence can be removed 
if one works instead with the requirement S(s) > S c and 
5(s — As) < S c (recall that s and R are equivalent vari- 
ables). Thus, if v = dd/ds, then MS argued that the 
fraction /(s) of interest satisfies 

poo rS c +vAs 

Asf(s)= / dv / dSp(v,S), (4) 



where p(v, 8) is the joint distribution of 8 and its deriva- 
tive v = d8/ds. Taking the limit As — > ds <C 1 implies 



/(s) = / dvvp(v,S c ) , 
Jo 



(5) 



We will see shortly that the issue is what exactly to use 
for p(y, 8). 

The rms values of 8 and v are ao and (2 r yo~o)~ 1 , re- 
spectively. So, if we define 



Sc/<Jo 



and 



x = 2yao v, 



(6) 



(our choice of notation will become clear shortly) and 
if we choose to average over all positions in the initial 
Gaussian random field which have height 8 C on scale s, 
then equation ([5]) implies that 



exp(-i/ 2 /2) 



dxxpG(x — 7^; 1 — 7 ) (7) 



27V2tt 

where pa (y — Mi s ) a Gaussian distribution for variable 
y, with mean m and variance s. The right hand side is 
clearly a function of the scaling variable v and 7. The de- 
pendence on 7 means that the result is not a completely 
universal function of v, but MS argued that, over the 
range of power spectra of current interest in cosmology, 
this dependence is relatively weak. Therefore, it is useful 
to use the fact that vf(v) = s/(s) |dlns/dln^j = 2s/(s) 
to write the expression above in terms of the scaling vari- 
able v: 



vfyis (f) 



v exp(-^ 2 / 2 ) H7,7^)t, 



71/ 



(8) 



where 



(x|7,z»)i> 



dxxpaix — x t ; 1 — 7 ). (9) 



This shows that f(u) is the product of the Gaussian prob- 
ability of having height v and the mean 'curvature' as- 
sociated with such positions. (This integral can be done 
analytically; see MS. Also, although we will not focus on 
the expected departures from universality in this model, 
which come from the dependence on 7, we would like to 
emphasize that small departures are predicted.) 

MS showed that this expression, based on their 'one- 
step' approximation, provided an excellent description of 
the exact solution in which the constraint on the walk 
height is satisfied on all scales. While this is significant 
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Figure 1. Mass funct ion of halos iden t ified in a ACDM 
simulation (solid red; ISheth fe Tormenl . [l999, with their 
= 0.707) and as pr edicted by the excursion set approach 
|musso fc Shethl . 120121) (dotted black) and with v -> \/07f07u 
(dot-dashed blue) in equation JS}. This plot uses TopHat fil- 
tering of the ACDM power spectrum. 



- it effectively solves t he same excursion set problem for 
'correlated' steps that lBond et al.l l|l99ll ) solved for 'un- 
corrected' steps - it does not really solve the problem 
as stated at the beginning of this sub-section. Namely, 
the quantity of interest is a volume fraction in the ini- 
tial field. One should estimate this by averaging over the 
full set of walks in one realization of the field. But what 
has actually been calculated is an ergodic average over 
an ensemble of walks in which the steps in each walk are 
correlated, b ut the walks thems elves are independent of 
one another. ISheth et al.l 1 200ll i demonstrated that this 
replacement is incorrect; in fact, one must either account 
for the correlations between walks, or account for the 
fact that the set of walks over which one should aver- 
age is a special subset of all wa lks (also see IShethl l201ll ; 
IParaniape. Lam fc Shethl l2012h . 

Figure [1] provides one illustration of why the dis- 
tinction matters. (For this plot only, we have used a 
Tophat, rather than Gaussian, filter in all integrals over 
P(k).) The solid curv e shows the fitting function of 
ISheth fc Tormenl (|l999l ) 



vfsriv) = 0.644 \l + {qv 2 y 



qv 2 exp(—qv 2 /2) 




which provides a good description of halo counts in sim- 
ulations. As mentioned in the Introduction, this function 
is expressed in terms of the scaling variable yfqv, where v 
is the same quantity which enters in the excursion set ap- 
proach and q ~ 0.7. The dotted curve shows equation (J8j ; 
it vastly underestimates the halo counts at large v. 
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It is easy to see why this happens. At v 2 3> 1, 
v fsT(v) — > 0.644^/ qv 2 exp(— (j , i/ 2 /2)\/27r whereas equa- 
tion iJSJ becomes v fyis(y) — > 1/ exp(— u 2 /2) /\/2tt. This 
shows that if we rescale f — > y^gf in equation (|8j) , then 
the result should provide a good description of the halo 
counts, upto the difference in amplitude of 0.644. How- 
ever, even this difference in amplitude can be accounted 
for by noting that the term in square brackets in equa- 
tion (110[) only equals unity at very large v. At v 2 = 10, it 
is « 3/2, so multiplying by 0.644 yields unity. The dot- 
dashed curve shows that, indeed, rescaling v — » VO.707^ 
in equation ([8} works very well. 

Note that q < 1 is required to achieve this agreement 
at large v. While it is tempting to associate this rescal- 
ing with a reduction in the value of 8 C , this is problem- 
atic because direct measurements of the overdensity in 
patches which are destined to form halos show that the 
critical density required for collapse increases at small 
v. This increase is qualitatively consistent with expecta- 
tions based on modelling halo f ormation using a t riaxial 
rather than spherical collapse (|Sheth et all l200lf ). I.e., 
the physics of halo formation suggests that, if we want 
to think of the parameter q as rescaling S c , then q should 
be greater, rather than less than unity. The alternative, 
w hich we will not explore further here (but see discussion 
in lParaniape et alll2012t ). is to assume that y/q rescales a 
rather than S c . Rather, the next section explores a quite 
different reason for why rescaling with q < 1 works so 
well. The main point we wish to make here is that, ab- 
sent an understanding of why rescaling v was necessary, 
one should not use the shape of f(v) to make conclusions 
about whether the physics of collapse was spherical or 
not; the evidence for triaxial collapse comes from the di- 
rect measurements of the propert ies of the patches from 
which halos formed (i.e., those in lSheth et all feoOll ). 



2.3 Peaks in the initial field 

In the argument which led to equation we noted that 
the predicted mass fraction f(y) depends critically on 
what one chooses for p(v\S c ). In what follows, we will 
show what happens if we wish to add the additional con- 
straint that, on scale s, 5 — S c is also a local maximum 
of the field. 

The number density of peaks of height 5 depends 
critically on the smoothing scale on which the peaks were 
defined. If ao denotes the rms value of the density fluc- 
tuation on the chosen smoothing scale, then the number 
density of peaks of scaled height v — 5/ao in a Gaussian- 
smoothed Gaussian random field is 

A/- pl » = / dxAf pk (x,v) = ^ (2° ( ff/2 . (") 
where 7 and R* were defined earlier, and 

r 00 

Gj{-y,x,) = / dxx J F(x)p G {x-x*;l-^), (12) 
Jo 



with 



F(x) = 




(equations A14-A19 in BBKS). The variable x is the 
Laplacian of the field normalized by its rms value (for 
Gaussian filters, this rms is 02), so it represents the cur- 
vature around the peak position. Therefore, F(x) quan- 
tifies how different the set of curvatures is around a peak 
position compared to a randomly placed one. 

To map from peak number densities to halo mass 
fractions, one must associate a mass with each peak. 
The natural choice is the mass m contained within the 
smoothing window: m — pV with V oc R 3 . But this 
has the unfortunate consequence that peaks of different 
height v will all have the same mass, whereas the intu- 
itive expectation is that more massive objects should be 
associated with higher peaks. 

I.e., the intuitive picture is one in which there is a 
critical density contrast <5 C which is associated with halos 
(in what follows we will set S c — 1.686, thus ignoring the 
mild dependence on cosmology predicted by the spherical 
collapse model), and massive halos have large v = 5 c /ao 
because they have small ao (this is what happens natu- 
rally in the excursion set approach). Thus, the main diffi- 
culty in identifying peaks with halos is that equation (|ll|l 
is defined for a fixed smoothing scale R (so changes in v 
are due to changes in 5) , whereas one would really like to 
allow R to vary instead. 

If one assumes naively (and incorrectly) that v in 
equation (lll[l has S c fixed and R varying, then one might 
naively (and incorrectly) assume that the mass fraction 
of the Universe that is in peaks of mass m is given by 

m e -" 2/2 V 

/bbksH = — AA pk (^) = —^—00(7,7^), (14) 

where we have defined V* = (2nR 2 ^ 3 ^ 2 . Quite apart from 
the mathematical inconsistency associated with making 
this assumption, there is a conceptual difficulty which is 
known as the cloud-in-cloud problem. This comes from 
considering how the density around a given position fluc- 
tuates as one changes the smoothing scale R. One might 
imagine that a given position is a peak on some smooth- 
ing scales and not on others; or that a position which is 
a local maximum of height v on a small smoothing scale 
may have an even larger value of v on a large smooth- 
ing scale, without being a local maximum of the field on 
the larger smoothing scale. Which, if any of these cases, 
should one associate with halos? 

2.4 Excursion set peaks 

As MS noted, the excursion set approach of the previous 
section shows how one might address this problem more 
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consistently. Namely, it says that of the peaks present 
on scale s, we want those which have a smaller height on 
the next larger smoothing scale. Therefore, the same logic 
which led to equation © (i.e., demanding that the scaled 
peak height lie between v = S c /ao and v + (31/27) A Ins) 
will now yield 



WespM = — / dxxAT pk (v,x) . 
l v Jo 



(15) 



making 



/ B s P M = ^^Go (7l7 ,)^^sP, (16) 



2^ V, 



~1 V 



where 



(a;|7,x*) ESP = Gi(7,z„)/Go(7,z.)- 



(17) 



(This is where the choice of a Gaussian for the smooth- 
ing filter simplifies the analysis, since a constraint on the 
value of the derivative with respect to smoothing scale 
becomes a constraint on the curvature of the field. This 
also explains why, in equation ((6| we used x to denote 
dS/ds normalized by its rms value.) 

Essentially this same formula fo r peaks, equa- 
tion ()16|) . has appeared previously (|Appel fc Jonesl . 
1990). However, we believe our treatment highlights the 
similarities and differences between peaks and random 
positions more clearly. In particular, notice that /bsp 
modifies the peaks probability in the same way that /ms 
modifies the gaussian pdf: the distribution of peaks picks 
up an additional factor of the normalised mean peak cur- 
vature. In this respect, the only conceptual difference be- 
tween /esp and /ms is that the latter averages over 'ran- 
dom' positions in the field, whereas the former averages 
over 'special' ones. In other words, /esp addresses both 
the cloud-in-cloud problem for peaks (the fundamental 
failing of the peaks approach), and the question of how 
the excursion set predictions are modified if one averaged 
over special positions in the initial field (the fundamental 
failing of the excursion set approach). 

The analysis above shows that changing the ensem- 
ble over which the excursion set average is computed has 
a dramatic effect. To see this explicitly note that, at large 
v > 1, Go — ► 7 3 (^ 3 — 3f) and (x|7,7^) Egp — ► -yv. This 
makes 



/espH 



! /2 



(is 3 - 3u) 



V-y 3 
V. 



/ms(^) 



V-y 3 u 3 



(18) 

For P(k) oc k n , 7 3 V/V» = [(n + 3)/6] 3/2 is just a constant 
independent of m, so /esp//ms — > W^iji + 3)/6] 3//2 ; this 
grows rapidly at large v. We will return to this shortly. 

Figure [2] compares these different models for halos 
using Gaussian filtering of the same ACDM spectrum as 
before. The curves show the results for BBKS (dashed 
blue), MS (dotted black) and ESP (solid red). The dashed 
blue curve is anecdotal, since, as we argued, it is not 
well motivated. We have included it to illustrate that the 
difference between it and the more careful calculation 
(ESP) is small at large masses. 

The real interest in this plot is the fact that, at large 
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Figure 2. Mass functions with Gaussian filtering of a ACDM 
spectrum in the three formalisms discussed in the text: peaks 
(equation 1 141 dashed blue), excursion sets (equation [8] dotted 
black) and ESP fcquation ll6l solid red). The dot-dashed green 
curve shows equation JSJ with v — > \/0.7Q7i/; at large v, it is 
quite similar to ESP. As discussed in the text, the dashed 
blue curve for peaks is not well motivated and is only shown 
for comparison with the more appropriate ESP curve. 



masses, /ms lies about an order of magnitude below /esp- 
Although we argued that this is expected (equation 118)1 . 
it is interesting to consider this in view of our remarks 
in the Introduction about the discrepancy between the 
usual excursion set prediction and halo abundances in 
simulations. We noted that to fit halo abundances it was 
common to scale the usual excursion set prediction (i.e. 
/ms) by setting v -» av, with a ~ \/0.707 ( e .g. our Fig- 
ure [1} . The dot-dashed green curve in Figure [2] shows 
that setting v — > \/0.707i/ in equation |[5]| brings it into 
remarkably better agreement with /esp; values of a be- 
tween 0.7 ~ 0.8 also do well. This strongly suggests that 
much of the discrepancy between the usual excursion set 
predictions and halo abundances in simulations can be at- 
tributed to inappropriate averaging in the excursion set 
approach. We show in the next section that this has inter- 
esting consequences for the predicted spatial distribution 
of halos. 



2.5 Excursion set peaks with moving barriers 

Before moving on to the study of predicted halo bias, 
it is worth noting that this approach makes it particu- 
larly easy to see how to incorporate the effects of a scale 
dependent S c . E.g., if we set <5 C — > B(s), then 

1 f°° 

A^espM = — / dx(x -2jo- B')J\f pii (B/ao,x). 

1 U J2- 1 cr B' 

(19) 
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B(s) = 8 + aVs 
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■ MS (a = 0.5) 
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Figure 3. Mass functions with Gaussian filtering of a ACDM 
spectrum for barrier shapes of the type B(s) = S c + cty/s, for 
excursion sets and ESP. The curves show ESP (red) for a = 0.5 
(solid, following from equation 1191) and a = (dashed, equa- 
tion [IB), and the MS result (blue) for a = 0.5 (dot-dashed, 
their equation 5) and a = (dotted, equation [HJ. 



In models of triaxial (rather than spherical) collapse, 
it is a good appr oximation to set B(s) w 5 C + ay's 
[ Sheth et aD . l200ll '). making B/ao = u + a and 2^a B' = 
7«. Figure [3] compares this model with a — 0.5 (see 
Moreno et al. 2009 for why this value is interesting) with 
the case in which a — (B = 8 C is constant). This shows 
that while the shape of f(v) is indeed sensitive to the 
physics of collapse, this sensitivity can only be used as a 
diagnostic if one is confident that the statistical predic- 
tion has been based on the correct ensemble average. 



3 CONDITIONAL MASS FUNCTIONS AND 
HALO BIAS 

MS noted that it was straightforward to extend their 
analysis to make models of the mass fraction in ha- 
los of mass m (corresponding to variance s — uq(R) 
with R oc m 1 / 3 ) which are constrained to lie within re- 
gions of some specified size Ro (corresponding to variance 
So = oo(i?o)) and overdensity So- An explicit expression 
for the conditional mass fraction, which is associated with 
the uncondi tional one in equat ion ((8} has recently been 
provided bv lMusso et"afl |2012i , MPS). 

It is instructive to rewrite their result in the notation 
of BBKS, who provided a similar analysis for peaks. Us- 
ing the dictionary given in Appendix I All equation (28) 
of MPS can be written as 



dv/MPsHfo, So) = Av p - 



(20) 



12-k iv 

with (a;|7,x, ) MS denned in equation and the 



BBKS quantities {l/p,"/^} given in equations (|A1|I , ()A2|) 
and (lA3)l . 

Exactly the same logic as before, when applied to 
the BBKS expression for peaks conditioned on having So 
on scale So (equation Ell of BBKS), leads to 



e-"p /2 V 

dvf ES p(v\So,S ) = Av v ^_ —Go (7, 7#) 



x (^17,7^)1 
7^ 

with ( cc|7, x t } ESP defined in equation (|17|) . 



(21) 



3.1 Halo bias 

MPS argued that a useful way of defining bias coefficients 
for a Gaussian field is to cross-correlate the density of 
the biased tracers with Hermite polynomials in the mat- 
ter overdensity. The former is defined as the ratio of the 
conditional and unconditional mass fractions 



lp h \S } = f(v\6 ,S )/f(v) 



(22) 



and MPS showed that applying this prescription to the 
excursion sets result (equations[20]and [8} leads to closed- 
form expressions for the bias coefficients: 

b n = S- n/2 ^ Ph H n {8o/^S'o)) 

ASoPg{So; So) (ph\5o ) H„(8o/VSo) , 

(23) 



-n/2 



where H n (x) = e x ^ 2 (—d/dx) n e~ x ~^ 2 are the "proba- 
bilist's" Hermite polynomials. 

MPS also showed that these b n have the structure 



f ) £ 



(24) 



where 5 X and e x are given in equation (|A5[) . and that 
the scale-independent (but mass-dependent) b nr could 
be naturally interpreted as bias coe fficients in Fo urier 
space, with conne ctions to the work of lSzalavl (|l988T ) and 
iMatsubaral l|201ll ). They also showed that, at least for 
n — 1,2, the b nr satisfy some remarkable linear rela- 
tions between each other: for fixed n, all the bu r with 
1 < r < k < n can be written as linear combinations 
of bfco, 1 < k < n. More surprisingly, they showed that 
the peaks bias parameters a t linea r and quadratic order 
derived bv lDesiacaues et al.l (|2010T > also satisfied exactly 
the same linear relations between coefficients, although 
the form of the coefficients themselves was different. 

Our results above allow us to generalise the connec- 
tion between peaks and excursion sets bias to all orders. 
Notice that the quantity (ph\So) ~ f(v\So, So)/f(v) is 
given by 



[ph\5o,So) 



df e 



(25) 



" 2 /2/^ Gj(7, 7 i/) ' 

This expression applies to all three formalisms with ap- 
propriate choices for F(x) and J: for excursion sets 
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F(x) = 1, J = 1, while for peaks and ESP we use equa- 
tion (|13[) for F(x), with J = for peaks and J = 1 
for ESP. As a result, the only real difference between 
(Ph|^o } defined for peaks, excursion sets or the ESP ex- 
tension is in the choice of the curvature function F(x). 
This function is independent of <5o and simply goes for 
a ride in the series expansion that defines the bias co- 
efficients. More precisely, the MPS result, that the b n 
are Taylor coefficients of the expansion of ( Phl^o, c = } 
in powers of So (where c is the matrix given in equa- 
tion |A6]), relied only on the properties of the Gaussian 
~ e~"f' 2 pG(i — 7#; f — 7 2 ) and not on the fact that they 
were analysing the special case F(x) = f. This result 
therefore applies equally well to peaks theory and its ex- 
tension. 

This allows us to generalise the results in Appendix 
A of MPS trivially. We show how to do this in Ap- 
pendix [A2] and we find 



Sob n =[^) Yl [ ^(l-ex)***^, (26) 



fc=0 

with the independent quantities fi k and X k defined as 
H k = u h H k (u) ; X k = {-Vu) k (H k {y-Yv)) F , (27) 
~" 1 ~ J>1 and the F-averaged Hermite 



where P^ = 7 /(l — 7" 
polynomial is 



(H k (y-y*)) F 

J °° dyy J F(yy/T)p G (y - Tv; l)H k (y - y,) 
^dyyJF{ yl /T)p G {y-Y V ;l) 

(28) 

ft is straightforward to check that setting F(x) = f, 
J = 1 recovers the results of MPS, while setting J = 
an d using equa tion (If 31) with n — 1,2 recovers those 
of iDesiacaues et all (|2010l ). More interestingly, equa- 
tion (|26(l can be rearranged to write equation (|24[) with 



ftr = (-l) r ^ 



(29) 



The fik are independent of the function F(x), so that it is 
useful to set r = and re-express the ^-dependent A^ in 
terms of the peak-background split parameters b„o. Using 
jUo = 1 = Ao we can write <5"6„o — fi n = J2k=i D n kX k , 
where D nk = Q!)/in-& is an invertible lower-triangular 
matrix with diagonal elements unity, and we find 

Ai = 5 c 6io — Mi , 

X„ = 5™6 n o — /Un — ( L l ™- k ^ k ' n > 1 • 

This explicitly demonstrates how all the b kr for r < k < n 
can be expressed in terms of the b k o with k < n, for arbi- 
trary n, thus generalising the MPS result for n = f,2. 
This algebraic structure is clearly independent of the 
choice of F(x) and J and also holds for peaks theory 
and ESP. Furthermore, the expressions above are ex- 
tremely simple ways of calculating bias parameters at 
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Figure 4. Linear bias predicted by peaks, excursion sets and 
ESP for a Gaussian filtered ACDM spectrum, coded as in 
Figure [2] The cross-correlation in equation I I23H was defined 
on a Lagrangian scale Rq = 0.64 X 40h~ 1 Mpc (at which 
the Gaussian filter encloses the same mass as the TopHat 
filter at 40h~ 1 Mpc). Notice that whereas the replacement 
v -> VoJWfv in the MS mass function improved its agree- 
ment with the peaks and ESP mass functions at large masses, 
the same replacement under-predicts the linear bias at large 
masses. Interestingly, this is qualitatively similar to what is 
seen in TV-body simulations when comparing TopHat filtered 
mass functions and their associated linear bias relations. See 
text for a discussion. 



a ny order (compared, e.g ., to the painstaking calculation 
in IDesiacaues et al]|201Cll for n < 2). 

Figure [4] shows the predicted linear bias for peaks, 
excursion sets and ESP for Gaussian filtering of the same 
ACDM spectrum and with the same colour-coding as in 
Figure [2l with the cross-correlation in equation (|23[) de- 
fined on a Lagrangian scale Ro = 0.64 x 40/i -1 Mpc (at 
which the Gaussian filter encloses the same mass as the 
TopHat filter at 40/i Mpc). All three formalisms pre- 
dict that 5 c bi approaches ~ (S x / Sq)v 2 at large masses. 
For this reason, the modified excursion set result with 
v — > av (dot-dashed green) predicts a linear bias that is 
below the one for peaks and ESP at large masses. This 
is interesting because it is qualitatively the same as what 
has been recently found in A r -body simulations: at large 
masses, an analy t ical m ass function such as the one of 
ISheth fc Tormenl (l999) with a < 1 chosen to match the 
mass function of an iV-body simulation under-predicts 
the linear halo bias measured in the s ame simulation 
ijManera et all . l20I0r . iTinker et all . l20I0h . This suggests 
that an analysis based on peaks theory such as the one 
presented here is on the right track towards obtaining an 
accurate description of both the mass function and the 
halo bias from first principles. 
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4 DISCUSSION 

We showed that, especially if one accounts for corre- 
lations between steps, the standard, spherical-collapse 
based excursion set prediction for halo mass fractions 
(equation [5} vastly underestimates what is measured in 
simulations. However, rescaling the spherical collapse mo- 
tivated self-similar scaling variable v — \ \/0.7v results in 
much better agreement (Figure [TJ. We noted that this 
agreement should not be used to argue that halos formed 
from a spherical collapse - at least, not until the reason 
for the adhoc rescaling of v is understood. 

We then argued that the rescaling was related to 
a flaw i n the usual formu lation of the excursion set ap- 
proach (|Bond et al| [l991). in which one replaces an av- 
erage over spatial positions in one realization of the field 
with an ergodic average over many independent realiza- 
tions of the field. Although we are not the first to have 
noted this problem, much previous work has attempted to 
rectify this by accounting for spatial correlations between 
walks. However, measurements in simulations showing 
that halos form around special po sitions in the initial 
fluctuation field I Sheth et al.l . fioOll ) suggest that it may 
be more productive to instead modify the ensemble over 
which one computes statistical averages. 

We then used peaks theory to illustrate this point, by 
showing how to incorporate the peaks constraint into the 
excursion set formalism. Specifically, peaks correspond to 
regions around which the curvature of the local density is 
modified (BBKS), and this, we argued, modifies the ex- 
cursion set prediction from equation ([8]) to equation (|f 6|) . 
In fact, the fundamental role played by the curvature dis- 
tribution F(x) (eauationlI3|) in our analysis suggests that 
to build an accurate model of halo abundances, all one 
needs is a good model for the initial profile shapes from 
which halos form. For example, one might combine mea- 
surements of the density run around virialized halos with 
infall models to infer what the initial overdensity profiles 
must have been; having found them, one could use them 
instead of F(x) in equation (|I6I) and so predict the halo 
mass fraction f(v). This is in progress. 

Although our analysis has gone some way towards 
addressing the real cloud in cloud problem (correlated 
steps and correlated walks), there is more that can be 
done in this direction. This is because our analysis is 
fundamentally about taking 'one small step' beyond that 
on which the object was defined; therefore, it does not 
correctly account for small objects which are embedded 
in much more massive objects (i.e., when the smoothing 
scales are rather different). Some of th e nicest work in 
this d irection is in a series of papers by ( Manrique et all 
1998], and references therein); we are in the process of 
incorporating their work into our analysis. 

Our formulation of peaks in the excursion set lan- 
guage made it particularly easy to see how to build an 
excursion set model for peaks even when the question of 
which peaks are interesting depends on smoothing scale 
- the analogue of moving barriers in the excursion set 
approach (equation [19] and Figure [3|. This may prove 



necessary if one wishes to incorporate the effects of the 
stochasticity associated with non-spherical collapse into 
the excursion set peaks predictions. 

The similarity in formulation also allowed a simple 
description of how peak abundances are modified if the 
large scale density field is constrained to be different in 
some way (equation [21} . In turn, this allowed a simple 
generalization of earlier results on peak and halo bias 
to all orders (equations I26lf30|) . In particular, we showed 
that excursion set peak bias is most easily understood in 
Fourier space, where it is fc-dependent even at the linear 
level. 

Although we concentrated on an excursion set anal- 
ysis of peaks, the MS 'one-step' argument should ap- 
ply to other traditionally single-scale analyses of cos- 
mological datasets. For example, since the argument is 
not restricted to three dimensional fields, it can be ap- 
plied to interpret the CMB temperature distribution, 
which is a two dimensional (nearly if not exactly Gaus- 
sian) random field. The number density and clustering 
of 'hotspots', as a function of spot temperature, has 
been used as a diagnostic of the Gaussianity of thi s field 
( Bond fe Efstathioul . 1 19871 : iHeavens fc Shethl . Il999h . But 
since some hot spots will be local maxima on larger 
smoothing scales as well, it is of interest to describe 
how the distribution of sizes (and the clustering) of re- 
gions which lie above some threshold temperature de- 
pends on the value of threshold. Clearly, the analysis 
presented here can be applied to that problem directly. 
In three dimensions, perhaps the most interesting con- 
nection and application is to the seri es of recent pa- 
pers on the 'skeleton' of the cosmic web i Poeosvan et all 
2009). This is the subject of ongoing work, where we 
h ope to make a connection to the multi-scale an alyses 
of lAragon-Calvo. van de Weygaert fc Jones] l|2010h . 

Although essentially all the analysis in this paper 
used Gaussian smoothing filters (section 12.11 discussed 
why, in the present context, they simplify the analysis 
substantially), we do not think they are otherwise spe- 
cial, so we are in the process of extending our results to 
include tophat smoothing filters. Since fitting functions 
for halo counts in simulations use tophat filtering (for 
the conversion between a and halo mass) exclusively, un- 
til our analysis does the same, a direct comparison with 
measurements of halo mass functions in simulations is 
premature. 

This is particularly interesting in view of the fact 
that the linear bias factor in our excursion set peaks 
model is close to the usual excursion set predictions as- 
sociated with rescaled S c at small masses, but with no 
rescaling of 8 C at high masses (Figure [4|. This last is 
in qualititive agreement with measurements of halo bias 
in simulations. We believe that matching the enhanced 
abundance and bias at large masses (Figures [2] and [4| , 
without having to rescale the parameter which is asso- 
ciated with the physics of halo formation, are nontrivial 
and encouraging successes. 



© 0000 RAS, MNRAS OOO.rflfTol 



Excursion set peaks 9 



ACKNOWLEDGEMENTS 

We thank V. Desjacques for useful discussions. This 
work is supported in part by NSF-0908241 and NASA 
NNX11A125G. RKS thanks the GEPI and LUTH groups 
at Meudon Observatory for hospitality during the sum- 
mer of 2012. 



References 

Aragon-Calvo M. A., van de Weygaert R., Jones 

B. J. T, 2010, MNRAS, 408, 2163 
Appel L., Jones B. J. T., 1990, MNRAS, 245, 522 
Bagla J. S., Khandai N., Kulkarni G., 2009, MNRAS, 

submitted (arXiv:0908:2702) 
Bardeen J. M., Bond J. R., Kaiser N., Szalay A. S., 1986, 

ApJ, 304, 15 

Bond J. R., Efstathiou G., 1987, MNRAS, 226, 655 
Bond J. R., Cole S., Efstathiou G., Kaiser N., 1991, ApJ, 
379, 440 

Desjacques V., Crocce M., Scoccimarro R., Sheth R. K., 

2010, PRD, 82, 103529 
Heavens A., Sheth R. K., 1999, MNRAS, 310, 1062 
Ludlow A. D., Porciani C, 2011, MNRAS, 413, 1961 
Manera M., Scoccimarro R., Sheth R. K., 2010, MN- 
RAS, 402, 589 
Matsubara T., 2011, PRD, 83, 083518 
Manrique A., Raig A., Solanes J. A., Gonzalez-Casado 

G., Stein P., Salvador-Sole, 1998, ApJ, 499, 548 
Musso M., Paranjape A., Sheth R. K., 2012, MNRAS, 

accepted, larXiv:1205.340T1 
Musso M., Sheth R. K., 2012, MNRAS, 423, 102 
Paranjape A., Lam T.-Y., Sheth R. K., 2012, MNRAS, 
420, 1429 

Pogosyan D., Pichon C, Gay C, Prunet S., Cardoso J. 

F., Sousbie T., Colombi S., 2009, MNRAS, 396, 635 
Press W. H., Schechter P., 1974, ApJ, 187, 425 
Szalay A., 1988, ApJ, 333, 21 
Sheth R. K., Tormen G., 1999, MNRAS, 308, 119 
Sheth R. K., Mo H. J., Tormen G., 2001, MNRAS, 323, 

1 

Sheth R. K., 2011, Pramana, 77, 169 

Tinker J. L., Robertson B. E., Kravtsov A. V., Klypin 

A., Warren M. S., Yepes G., Gottlober S., 2010, 724, 

878 



defined by MPS (they denoted v = dS/ds as 8') is 

Sex 



, -2 . 2-2 

t — 7 = 4sj a 



Va,r(v\v, d ) 



{ v 2 ) Var(f) 
(v\u,S ) 



(Al) 
(A2) 
(A3) 



where 



5 



So 



S' = (v\u, So) = 



2sQ 



V&r(v\v, S ) = 



S x \ Sp 
So J s 



Sx ( r r Sx So 



4r 2 s 



r 2 s S 2 x (l-e x ) 



2 1 



Qs 



s' 2 



(A4) 



with 



[SSo) 



2s 



vS ) 



2^L. (A5) 



Sx d In s 

Related to these is the matrix c discussed in section [3~T1 
which is the quantity one must subtract from the un- 
conditional covariance matrix of the variables (S, v) to 
obtain the covariance matrix of the conditional Gaussian 
p(S, v\5o)- This follows from equation (14) of MPS: 

- = S^\ a ex/2 " 
C sSo [ ex/2 e\ /4s _ ' 

A2 Generalising the MPS results for bias 



(A6) 



For the reasons mentioned in section 13.11 it is straight- 
forward to generalise the results of MPS for halo bias to 
include peaks theory and its extension discussed in this 
work. To do this, we note that the results in their Ap- 
pendices A. 2 and A. 3 only depend on the form of the 
conditional Gaussian distribution p(v, x\So) (they work 
with p(S, v\So)) and not on the fact that their integrand 
of x used F{x) = 1. All that is needed then is to extend 
the results of their Appendix A. 4 to include an arbitrary 
function F(x) and value J in calculating the quantity 
<Ph|(5o,c = 0). With T 2 == 7 2 /(l - 7 2 ), equation (A8) of 
MPS can be generalised to obtain 

(p h |<5o,c = 0) 



APPENDIX A: TECHNICAL DETAILS 

In this Appendix we collect technical details of various 
results quoted in the text. 



Al From BBKS to MPS 

For Gaussian filters, the dictionary for converting be- 
tween (7, v, v p ) and the quantities (S x , e x , S cx , Q, S' , a) 



< / °° dyy J F(yy/T)p G (y - IV+ Spur, 1) 
^AyyJF{ yl lY)p G {y-T V -l) ' 

(A7) 

where we used y = aT/7 and followed MPS in defining 
So = So /8 C and u\ = Fu(Sx/S ){l - e x ). 

This can also be seen more directly as follows. As 
MPS discussed, the condition c = corresponds to the 
assignments 

Q->1; a (2F^y 1 ; S'/a -> (IV - S ui) , (A8) 
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or 

v v -> v(l-5o(Sx/So)), 
1-7 1 -7 , 

7?^7^(l-5o(S x /S*o)(l-ex)). (A9) 

Making these replacements in equation (|25[) gives equa- 
tion (|A7[) . Taylor expanding this expression gives the bias 
coefficients as (p h |5 ,c = 0) = Er=o^"(*" 6 »)/ nL Thc 
expansions of both Gaussians in equation (IA7I) involve 
Hermite polynomials, and lead to equation (I26|) , 
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